Estimating uncertainty in respondent-driven sampling using a tree bootstrap method.

نویسندگان

  • Aaron J Baraff
  • Tyler H McCormick
  • Adrian E Raftery
چکیده

Respondent-driven sampling (RDS) is a network-based form of chain-referral sampling used to estimate attributes of populations that are difficult to access using standard survey tools. Although it has grown quickly in popularity since its introduction, the statistical properties of RDS estimates remain elusive. In particular, the sampling variability of these estimates has been shown to be much higher than previously acknowledged, and even methods designed to account for RDS result in misleadingly narrow confidence intervals. In this paper, we introduce a tree bootstrap method for estimating uncertainty in RDS estimates based on resampling recruitment trees. We use simulations from known social networks to show that the tree bootstrap method not only outperforms existing methods but also captures the high variability of RDS, even in extreme cases with high design effects. We also apply the method to data from injecting drug users in Ukraine. Unlike other methods, the tree bootstrap depends only on the structure of the sampled recruitment trees, not on the attributes being measured on the respondents, so correlations between attributes can be estimated as well as variability. Our results suggest that it is possible to accurately assess the high level of uncertainty inherent in RDS.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sample Size Calculations for Population Size Estimation Studies Using Multiplier Methods With Respondent-Driven Sampling Surveys

BACKGROUND While guidance exists for obtaining population size estimates using multiplier methods with respondent-driven sampling surveys, we lack specific guidance for making sample size decisions. OBJECTIVE To guide the design of multiplier method population size estimation studies using respondent-driven sampling surveys to reduce the random error around the estimate obtained. METHODS Th...

متن کامل

A Bootstrap Interval Robust Data Envelopment Analysis for Estimate Efficiency and Ranking Hospitals

Data envelopment analysis (DEA) is one of non-parametric methods for evaluating efficiency of each unit. Limited resources in healthcare economy is the main reason in measuring efficiency of hospitals. In this study, a bootstrap interval data envelopment analysis (BIRDEA) is proposed for measuring the efficiency of hospitals affiliated with the Hamedan University of Medical Sciences. The propos...

متن کامل

مدیریت و اجرای نمونه‌گیری از معتادان تزریقی در معرض بیماری‌های پرخطر

Objectives: Study of hidden populations such as Injection Drug Users (IDU) due to their exposing to high risk diseases and their roll in public health is very crucial. Conventional statistical methods for sampling these populations are not applicable because of the restrictions these populations are faced. Despite the introduction of respondent driven sampling as a successful method for samplin...

متن کامل

Statistical properties of bootstrap estimation of phylogenetic variability from nucleotide sequences. I. Four taxa with a molecular clock.

The statistical properties of sample estimation and bootstrap estimation of phylogenetic variability from a sample of nucleotide sequences are studied by using model trees of three taxa with an outgroup and by assuming a constant rate of nucleotide substitution. The maximum-parsimony method of tree reconstruction is used. An analytic formula is derived for estimating the sequence length that is...

متن کامل

نمونه‌گیری پاسخگو محور در مقایسه با سایر روش‌های نمونه‌گیری از جوامع پنهان

Sampling hidden populations is challenging due to the lack of convenience statistical frames. Since most populations exposed to special diseases are hidden and hard to reach, sampling methods that produce representative and efficient samples from the populations have become a study subject for researches all over the world. Because of the unknown probability of selecting samples in conventional...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 113 51  شماره 

صفحات  -

تاریخ انتشار 2016